CDS

Accession Number TCMCG007C36101
gbkey CDS
Protein Id XP_009148548.1
Location complement(4767343..4768290)
Gene LOC103871962
GeneID 103871962
Organism Brassica rapa

Protein

Length 315aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA249065
db_source XM_009150300.3
Definition uncharacterized protein LOC103871962 [Brassica rapa]

EGGNOG-MAPPER Annotation

COG_category O
Description The proteasome is a multicatalytic proteinase complex which is characterized by its ability to cleave peptides with Arg, Phe, Tyr, Leu, and Glu adjacent to the leaving group at neutral or slightly basic pH
KEGG_TC -
KEGG_Module M00340        [VIEW IN KEGG]
KEGG_Reaction -
KEGG_rclass -
BRITE ko00000        [VIEW IN KEGG]
ko00001        [VIEW IN KEGG]
ko00002        [VIEW IN KEGG]
ko01000        [VIEW IN KEGG]
ko01002        [VIEW IN KEGG]
ko03051        [VIEW IN KEGG]
KEGG_ko ko:K02737        [VIEW IN KEGG]
EC 3.4.25.1        [VIEW IN KEGG]        [VIEW IN INGREDIENT]
KEGG_Pathway ko03050        [VIEW IN KEGG]
map03050        [VIEW IN KEGG]
GOs GO:0000502        [VIEW IN EMBL-EBI]
GO:0005575        [VIEW IN EMBL-EBI]
GO:0005622        [VIEW IN EMBL-EBI]
GO:0005623        [VIEW IN EMBL-EBI]
GO:0005839        [VIEW IN EMBL-EBI]
GO:0019774        [VIEW IN EMBL-EBI]
GO:0032991        [VIEW IN EMBL-EBI]
GO:0044424        [VIEW IN EMBL-EBI]
GO:0044464        [VIEW IN EMBL-EBI]
GO:1902494        [VIEW IN EMBL-EBI]
GO:1905368        [VIEW IN EMBL-EBI]
GO:1905369        [VIEW IN EMBL-EBI]

Sequence

CDS:  
ATGTCTCACCACCACTATGAAACCAACCCGCATTTCGTTCAGTTTTCACAGGACCACCATCCCGGTGGTCCTTCGAGCTCATGGACCTCCCCAGACCACCACCAGAACTCGCAGACTCACCCAGTTCCTCCAATTGGGCCGAAAATAAAGACTCGAGTACGCCATCAGACCGAGCCACCGGAACCAATCCATGAACCACCTTCCTCAAGACCCTTGCCACTGAGGCCAGAAGAACCTCTACCACCACGCTCTGGCAGGCCATTACTCTTAAGCCCTGAAGATCAACAACGACCTCCACACCATGGTGGCTATAAACCTGAACCAACTCCATGGTGGACCGCTCAAACACGACCAGCAGCTCATCAACCAGGTTCGAAGAGGACCGAACCCATGAAACTGACGGCTACAGTCTGCTGTGCAATTCTCCTGATCATCCTGATTCTTTCCGGTCTCATCCTCCTCCTCGTCTACCTCAGCAACCGCCCAAACACACCCTACTTCGACATCTCAGCAGCAACCTTAAACACCGCGAATCTCGACATGGGCTATTCCCTAAACGGAGACCTCGCCGTCGTGGTAAACTTCACAAACCCGAGCATGAAAAGCAACGTGGACTTCAGCTACATCATGTTCGAGCTCTTTTTTTACAACACACTCATAGCGACGGAACACATCGAGCCCTTCATTGTCCCAAAGGGAATGTCGATGTTCACCAGCTTCCATCTCGTGAGCAGTCAGGTCCCTATTGAAATGACTCAGAGCCAGGAGTTGCAGCTGCAGCTTGGAAACGGTCCTGTGTTGCTGAACCTGAGAGGAACGTTTCACGCGCGCTCGGACCTCGGGTCGTTTATGAGATACTCTTATTGGTTGCACACCCGTTGCAGCATCTCGTTGAATAGCCCTCCTTCAGGGTACATACGAGCAAGAAGATGCATTACCAGACGCTAG
Protein:  
MSHHHYETNPHFVQFSQDHHPGGPSSSWTSPDHHQNSQTHPVPPIGPKIKTRVRHQTEPPEPIHEPPSSRPLPLRPEEPLPPRSGRPLLLSPEDQQRPPHHGGYKPEPTPWWTAQTRPAAHQPGSKRTEPMKLTATVCCAILLIILILSGLILLLVYLSNRPNTPYFDISAATLNTANLDMGYSLNGDLAVVVNFTNPSMKSNVDFSYIMFELFFYNTLIATEHIEPFIVPKGMSMFTSFHLVSSQVPIEMTQSQELQLQLGNGPVLLNLRGTFHARSDLGSFMRYSYWLHTRCSISLNSPPSGYIRARRCITRR